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Amendments to the Claims ; 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims : 

1. (Withdrawn) A computer-implemented method ofanalyzing a sequence of amino acids, 
comprising; 

(a) designating each amino acid wittiin the sequence with a $ymbo]> wherein 
an amino acid is designated a £rst symbol if it is a member of a predetermined set of amino 
acids, and a second symbol different from the first symbol if the amino acid is not a member 
of the predetermined set, thereby producing a sequence of symbols; 

(b) determining which signals of the symbols are present in the sequence of 
symbols^ wherein a signal is a window of the sequence of symbols consisting of a predefined 
number of contiguous symbols; wherein the sequence of amino acids is analyzed from the 
identity of the signals present in the sequence of symbols. 

2. (Withdrawn) The method of claim 1, wherein the window consists of 5-15 contiguous 
symbols. 

3. (Withdrawn) The method of claim 1, wherein the window consists of 9 contiguous 
symbols.. 
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4. (Withdrawn) The method of claim I. wherein the predetermined set of amino acids 
consists of 4-10 amino acids, and at least 4 are selected from the group consisting of A, R, Q, 
E, L,KandM, 

5. (Withdrawn) The method of claim 4, wherein the predetermined set of amino acids 
consists of A, R, Q, E, L, K and M. 

6. (Withdrawn) The method of claim 1, wherein the predetermined set of amino acids 
consists of 4-10 amino acids, and at least 4 are selected from the group consisting of C, I, L, 
M,F, W,Y,andV. 

7. (Withdrawn) The method of claim 6, wherein the predetermined set of amino acids 
consists of C. I, L, M. F, W, Y, and V. 

8. (Withdrawn) The method of claim 1 » further comprising transforming the sequence of 
symbols into a sequence of signal designations, wherein different designations are used to 
represent different signals in the sequence of symbols. 

9. (Withdrawn) The method of claim 1, wherein an amino acid is designated with a first type 
of second symbol if it is pare of a second predetermined set of amino acids, and a second type 
of second symbol if it is not part of the second set of amino acids. 
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10. (Withdrawn) The method of claim 1, wherein the signals present in the sequence of 
symbols are assigned grades according to the probability that the observed frequency of a 
signal in a collection of proteins in which each amino acid has been designated with a symbol 
occurs by chance^ wherein the grade increases with decreasing probability. 

1 1 . (Withdrawn) The metliod of claim 10, wherein the signals are classified as significant or 
not significant signals depending whether the grade exceeds a threshold. 

12. (Withdrawn) The method of claim 1 1, wherein the threshold is a ^>8 that the observed 
firequency of the signal in the collection of proteins does not occur by chance. 

13. (Withdrawn) The method of claim 12; fiirther comprising detemiining the number and 
identity of significant signals in the amino acid sequence. 

14. (Withdrawn) The method of claim 13, wherein the sequence of amino acids is a 
theoretical amino acid sequence, and the method further con:^)rises determining the 
probability that the theoretical amino acid sequence is an actual protein by comparing the 
expected number of significant signals in the theoretical amino acid sequence to the actual 
number of significant signals in die theoretical amino acid sequence. 

15. (Withdrawn) The method of claim 14, wherein the theoretical amino acid sequence is 
designated as an actual protein sequence if the probability that the observed significant 
signals in the sequence arose by chance i$ 10~'^ or less. 
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16. (Withdrawn) The method of claim 1, wherein the sequence of amino acids is from a 
known protein. 

17. (Withdrawn) The method of claim 1, wherein the sequence of amino acids is from a 
putative protein. 

18. (Withdrawn) The method of claim 1, fiirther comprising repeating steps (a) and (b) for a 
second sequence of amino acids and aligning the sequences of symbols produced from the 
first and second sequences of amino acids for maximum conservation of significant signals. 

19. (Withdrawn) The method of claim 1 1, further comprising predicting the secondary 
stmcture of a segment of a protein located within the sequence of amino acids from the 
identity of significant signals. 

20. (Withdrawn) The method of claim 19> wherein the secondary structure is selected fix>m 
the group consisting of an alpha helix, beta strand, beta turn, tum+beta, helix+tum, helix cap, 
extended helix, Gly/Pro twist, bcta+tum, helix-hairpin, beta cap, helix hairpin, beta haiipin, 
contorted helix, turn, helix+tum n and helix turn. 

21. (Withdrawn) The method of claim 1, further comprising inputting the sequence of amino 
acids into the computer. 

22. (Withdrawn) The method of claim 21 » wherein the sequence of amino acids is input by 
transfer of data from a database. 
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23. (Withdrawn) The method of claim 1, further comprising outputting the identity of 
signals present in the sequence of symbols. 

24. (Withdrawn) The method of claim 23, wherein the signals are output in an order 
corresponding to the order of amino acids in the sequence of amino acids. 

25. (Withdrawn) The method of claim I , further comprising providing user input of the 
predefined number of contiguous symbols of the window. 

26. (Withdrawn) The method of claim 10, further comprising calculating the probability that 
the observed frequency of a signal in the collection of proteins in which each amino acid has 
been designated with a symbol occurs by chance. 

27. (Witlidrawn) The method of claim 1, wherein step (b) deteraiines the identity of L-(P-1) 
signals within the sequence of amino acids, where L is length of the sequence of amino acids 
and P is the predefined number of contiguous symbols in the window. 

28. (Withdrawn) The method of claim 1 , further comprising assigning the determined 
signals designations, a different designation being used for each unique signal. 

29. (Withdrawn) The method of claim 1, further comprising aualy2dng the sequence of 
amino acids from the identity of the signals. 
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30. (Withdrawn) A computer implemented method of identifying a set of amino acids useful 
for the analysis of proteins, comprising 

(a) designating each amino acid within each of a collection of proteins with a 
symbol, wherein an amino acid is designated a first symbol if it is a member of a first test set 
of amino acids, and a second symbol different from the first symbol if the amino acid is not a 
member of the test set, thereby producing a collection of sequences of symbols; 

(b) determining the number of occurrences of different signals of the symbols 
in the collection of sequences of symbols, wherein a signal is a window of the sequence of 
symbols consisting of a predefined number of contiguous symbols; and 

(c) determining the probability that the distribution of the number of signals of 
each signal strength occurs by chance, wherein the lower the probability the more useful the 
test set of amino acids is for protein analysis. 

31. (Withdrawn) The method of claim 30. further comprising repeating steps (a), (b) and (c) 
for a second test set of amino acids. 

32. (Withdrawn) The method of claim 31, wherein the second test set differs from the first 
test set by the addition, deletion, or substitution of an amino acid from the first test set 

33. (Withdrawn) The method of claim 32, further comprising repeating steps (a), (b) and (c) 
for each possible unique set of amino acids consisting of 4-10 amino acids. 
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34. CWithdrawn) The method of claim 1, further comprising comparing the position and 
identity of each signal present in the sequence of symbols to a conserved signal pattern 
present in a family of proteins. 

35. (Withdrawn) A computer-implemented method of predicting the fold of a query protein 
compnsmg^ 

(a) designating each amino acid within a family of protein sequences with a 
symbol, wherein an amino acid is designated a first symbol if it is a member of a 
predetermined set of amino acids, and a second symbol different fix>m the first symbol if the 
amino acid is not a memba: of the set, thereby producing a plurality of sequences of symbols; 

(b) determining which signals of the symbols are present in the sequences of 
symbols, wherein a signal is a window of the sequence of symbols consisting of a predefined 
number of contiguous symbols; 

(c) determining a conserved signal pattern between members of the family; 

(d) analyzing a query protein to identify a signal pattem; 

(e) determining if the query protein's signal pattem exceeds a threshold of 
similarity to the conserved signal pattem; and 

(f) if the signal pattem of the query exceeds the threshold, designating the 
query as having the fold of the family. 

36. (Withdrawn) The mediod of claim 35, further comprising comparing the query protein's 
signal pattem to conserved signal patterns in an additional protein family. 
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37. (Withdrawn) The method of claim 36, wherein the family is selected from the list 
consisting of globins, lysozymes^ thioredoxins, trypsins, monoclonal antibodies, and amido 
transferases. 

38. (Withdrawn) Themethodof claim 35, wherein the conserved signal pattern includes a 
signal present in Table 14. 

39. (Withdrawn) The method of claim 35, wherein the conserved signal pattern includes a 
signal present in Table 15. 

Claims 40-45: (Canceled) 

46. (Withdrawn) A computer program product stored on a computer readable media for 
analyzing a sequence of amino acids, the program product comprising; 

(a) code for designating each amino acid within the sequence with a symbol, 
wherein an amino acid is designated a first symbol if it is a member of a predetermined set of 
amino acids, and a second symbol different from the first symbol if the amino acid is not a 
member of the set, thereby producing a sequence of symbols; and 

(b) code for determining which signals of the symbols are present in the 
sequence of symbols, wherein a signal is a window of the sequence of symbols consisting of a 
predefined number of contiguous symbols, wherein the sequence of amino acids is analyzed 
from the identity of the signals present in the sequence of symbols. 
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47. (Withdrawn) A computer program product stored on. a computer readable media for 
identifying a set of amino acids usefiil for the analysis of proteins, the program product 
comprising: 

(a) code for designating each amino acid within each of a collection of proteins 
with a symbol, wherein an amino acid is designated a first symbol if it is a member of a first 
test set of amino acids, and a second symbol different from the first symbol if the amino acid 
is not a member of the test set, thereby producing a collection of sequences of symbols; 

(b) code for determining the number of occurrences of different signals of the 
symbols in Ihe collection of sequences of symbols, wherein a signal is a window of the 
sequence of symbols consisting of a predefined number of contiguous symbols; and 

(c) code for determining the probability that the distribution of the number of 
signals of each signal strength occurs by chance, wherein the lower the probability the more 
usefiil the test set of amino acids is for protein analysis. 

48. (Withdrawn) A computer program product stored on a computer readable media for 
predicting the fold of a query protein, the program product comprising: 

(a) code for designating each amino acid within a family of protein sequences 
with a symbol, wherein an amino acid is designated a first symbol if it is a member of a 
predetermined set of amino acids, aiKl a second symbol different &om the first symbol if the 
amino acid is not a member of the set, thereby producing a plurality of sequences of symbols; 

(b) code for detemiining which signals of the symbols are present in the 
sequences of symbols, wherein a signal is a window of the sequence of symbols consisting of 
a predefined number of contiguous symbols; 
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(c) code for determining a conserved signal pattern between members of the 

family; 

(d) code for analyzing a query protein to identify a signal pattern; 

(e) code for determining if the query protein's signal pattern exceeds a 
threshold of similarity to the conserved signal pattern; and 

(f) code for designating the queiy as having the fold of the family if the signal 
pattern of the query exceeds the threshold. 

49. (Currently Amended) A computer program Droduct st o icd o n c omprising ! 

a computer readable media lu edimn havinp stored thereon a computer program for 
identifying a coding region of a nucleotide sequence, the profii ' am product s aid computer . 
program comprising: 

(a) code for translating all possible reading frames of a nucleotide sequmce 
into theoretical protein sequences; 

(b) code for designating each amino acid within the theoretical protein 
sequences with a symbol^ wherein an amino acid is designated a first symbol if it is a member 
of a first predetermined set of amino acids, and second symbol different fiom the first symbol 
if the amino acid is not a member of the predetermined set, thereby producing a collection of 
sequences of symbols; 

(c) c o de for dete r mining tlic number o f signiilcant signals in each ipading flame uf th e 
nucleotide sequence; and 

(d) cod e foi d e tei mining an expec t ed number o f significairt signals in each reading 

ii r ame of the uuclc ct idc sequenc e 
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(c) code for d etermininp an acrual nuTnber of said first symbols in each 
window alone each sequence of symbols, each said window compnsiTip a quantity of adiacent 
symbols in one of said sequences of symbols: 

(d) code for comparing each said actual number of first symbols for each said 
window to an expected number of first symbols and assigning a significant signal 
characteristic to said window if said actual number differs fiom said expected number by at 
least a predetermined t hreshold amount: and 

fe> code for determining an actual number of said significant signals in each 
said reading frame of said nucleotide sequence. 

50. (Withdrawn) A system for analyzing a sequence of amino acids, comprising: 

(a) a processor; and 

(b) a memory coupled to the processor configured to store a plurality of 
instructions which when executed by the processor cause the processor to: 

(i) designate each amino acid within the sequence with a symbol, 
wherein an amino acid is designated a first symbol if it is a member of a predetermined set of 
amino acids, and a second symbol different fiom the first symbol if the amino acid is not a 
member of the set, thereby producing a sequence of symbols; 

(ii) detemiine which signals of the symbols are present in the sequence 
of symbols, wherein a signal is a window of the sequence of symbols consisting of a 
predefined number of contiguous symbols^ wherein the sequence of amino acids is analyzed 
firom the identity of the signals present in the sequence of symbols. 
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5 1 . (Withdrawn) A system for identifying a set of amino acids useful for the analysis of 
proteins comprising: 

(a) a processor; and 

(b) a memory coupled to the processor configured to store a plurality of 
instructions which when executed by the processor cause the processor to: 

(i) designate each amino acid within each of a collection of proteins 
with a symbol, wherein an amino acid is designated a first symbol if it is a member of a first 
test set of amino acids, and a second symbol different from the first symbol if the amino acid 
is not amember of the test set, thereby producing a collection of sequences of symbols; 

(ii) determine the number of occurrences of different signals of the 
symbols in the collection of sequences of symbols^, wherein a signal is a window of the 
sequence of symbols consisting of a predefined number of contiguous sjnnbols; and 

(iii) detennine the probability that the distribution of the number of 
signals of each signal strength occurs by chance, wherein the lower the probability the more 
usefbl the test set of amino acids is for protein analysis. 

52. (Withdrawn) A system for predicting the fold of a query protein comprising: 

(a) a memory; 

(b) a system bus; 

(c) a processor operatively disposed to: 

(i) designate each amino acid within a family of protein sequences with 
a symboU wherein an amino acid is designated a first symbol if it is a member of a 
predetermined set of amino acids^ and a second symbol different fiom the first symbol if the 
amino acid is not a member of the set, thereby producing a plurality of sequences of symbols; 

Page 13 of 20 

PAGE 16/23 ' RCVD AT 11/30/2006 1:19:31 PM [Eastern Standard fime] ' SVR:USPT0-EF)(RF-1/1 ' DNIS:2738300'CSID: ' DURATION ^m-ss):05-56 



Noy-30-2006 OMOpin From- T-528 P. 01 7/023 F-144 



(ii) determine which signals of the symbols are present in the 
sequences of symbols, wherein a signal is a window of ihe sequence of symbols consisting of 
a predefined number of contiguous symbols; 

(iii) determine a conserved signal pattern between members of the 

family; 

(iv) analyze a query protein to identify a signal pattem; 

(v) determine if the query protein's signal pattem exceeds a threshold 
of similarity to the conserved signal pattem; and 

(vi) designate the query as having the fold of the &mily if the signal 
pattem of the query exceeds the Oure^old. 



53. (Currently Amended) A system for identifying a coding region of a nucleotide scquencQ. 
said system comprising: 

(a) a memory; 

(b) a system bus; 

(c) a processor oD cr a t iv c lv disp p scd t o which has stored thereon a computer 
program comprising : 

(i) trauAlate code for translating all possible reading frames of a 
nucleotide sequence into theoretical protein sequences; 

(ii) dc&ignatc c ode for designating each amino acid within the 
theoretical protein sequences with a symbol, wherein an amino acid is designated a first 
symbol if it is a member of a first predetermined set of amino acids, arid a second symbol 
different from the first symbol if the amino acid is not a member of the predetermined set, 
thereby producing a collection of sequences of symbols; 
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(iii) dctcimiiic the nuuibci ' of slgmliuaAit &igiial& in udLli leadiii^ Jaaxu c 
o f llic n ucle o tide seqtteii6e> and - 

(iv) dctcrmiue mi cApu^tLd munljei ' of significan t s i gnal s in each 

reading frame of the nuele^tidt scquLiicc 

fiii^ code for determining an actual number of said first symbols in 
each window along each sequence of symbols, each said window comnrising a g^ i^i^t^ ty of 
adjacent symbols in one of said sequences of symbols: 

fiy"^ code for comparinp each said actual number of first symbols for 
Pftfih ^rtri window to an expected num^^er of ft^t svynbo^ aa^fl assigning a significant signal 
characteri stic to said window if said actual number diff^ fixim said expected number bv at 
least a predetermined threshold amount: and 

(v'i code for determining m actual number of said sicmficant signals in 
each said reading firame of said nucleotide sequence . 

54 (new). A computer program product as recited in claim 49, wherein said quantity of 
adjacent symbols is in the range of firom fiye to fifteen. 

55 (new). A computer program product as recited in claim 49, wherein said computer 
program further comprises code for detemiining an expected nimiber of significant signals in 
each said reading firame of said nucleotide sequence. 
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56 (new). A computer program product as recited in claim 55, wherein said computer 
program further comprises code for comparing said actual number of significant signals in 
each said reading frame of said nucleotide sequence with said expected number of significant 
signals in each said reading firame of said nucleotide sequence. 

57 (new). A system as recited in claim 53, wherein said quantity of adjacent symbols is in the 
range of from five to fifteen. 

58 (new). A system as recited in claim 53» wherein said computer program finther comrpiises 
code for detennining an expected number of significant signals in each said reading fi-ame of 
said nucleotide sequence. 

59 (new). A system as recited in claim 58, wherein said computer program fiirther comprises 
code for comparing said actual number of significant signals in each said reading firame of 
said nucleotide sequence with said expected number of significant signals in each said 
reading frame of said nucleotide sequence. 



Page 16 of 20 



PAGE 1M23 * RCVD AT 11/30/2006 1:19:31 PM [Eastern Standard Tone]* SVR:USPT0-EFXRF-t/1 ' DNIS:2738300* CSID: * DURATION ^m-ss):0S-5( 



